Supplementary Material to Robust Structured Estimation with Single-Index Models

نویسندگان

  • Sheng Chen
  • Arindam Banerjee
چکیده

In this supplementary material, we present the deferred proofs of the results in the main paper. 1. Proof of Claim 1 Statement of Claim 1: Suppose that each element xi of x is sampled i.i.d. from Rademacher distribution, i.e., P(xi = 1) = P(xi = −1) = 0.5. Under model (3) with noise ε = 0, there exists a θ̄ ∈ Sp−1 together with a monotone f̄ , such that supp(θ̄) = supp(θ∗) and yi = f̄(⟨θ̄,xi⟩) for data {(xi, yi)}i=1 with arbitrarily large sample size n, while ∥θ̄ − θ∥2 > δ for some constant δ. Proof: In the noiseless setting with unknown f∗, provided that S , supp(θ∗) is given and |S| = s, the estimation of θ∗ is simplified as Find θS ∈ Ss−1 s.t. sign ( ⟨θS ,xiS − xjS⟩ ) = sign(yi − yj), (S.1) ∀ 1 ≤ i < j ≤ n , any of whose solution θ can be true θ∗ on the premise that no other information is available, since there always exists a monotone f satisfying f(⟨θ,xi⟩) = yi. Given the distribution of x, xiS − xjS only has 3 s possibilities even if n → +∞. We denote the feasible set of (S.1) by C, which is basically an intersection of Ss−1 and at most min{n(n− 1), 3} halfspaces (or hyperplanes if yi = yj). Depending on the 3 different values of each sign(yi − yj), this feasible set C has at most 3min{n(n−1),3p} possibilities, which is finite, and the union of them should be Ss−1. When s ≥ 2 and the constant δ is small enough, we can always find a C, in which there exist two different points away by δ. Specify them as θ∗S and θ̄S respectively, and Department of Computer Science & Engineering, University of Minnesota-Twin Cities, Minnesota, USA. Correspondence to: Sheng Chen , Arindam Banerjee . Proceedings of the 34 th International Conference on Machine Learning, Sydney, Australia, 2017. JMLR: W&CP.; Copyright 2017 by the author(s). we are unable to distinguish between them, as both can be solution to (S.1) for any samples. 2. Proof of Lemma 1 Statement of Lemma 1: Suppose the distribution of y in model (1) depends on x through ⟨θ∗,x⟩ and we define accordingly bi (z1, . . . , zm;θ ∗) = (S.2) E [qi (y1, . . . , ym) |⟨θ,x1⟩ = z1, . . . , ⟨θ,xm⟩ = zm] , With x being standard Gaussian N (0, I), u defined in (4) satisfies E [u ((x1, y1), . . . , (xm, ym))] = βθ∗ , (S.3) where β = ∑m i=1 E[bi (g1, . . . , gm;θ) · gi], and g1, . . . , gm are i.i.d. standard Gaussian. Proof: Let θ⊥ be any vector orthogonal to θ∗. For convenience, we use the shorthand notation u for u ((x1, y1), . . . , (xm, ym)). Then we have

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simultaneous robust estimation of multi-response surfaces in the presence of outliers

A robust approach should be considered when estimating regression coefficients in multi-response problems. Many models are derived from the least squares method. Because the presence of outlier data is unavoidable in most real cases and because the least squares method is sensitive to these types of points, robust regression approaches appear to be a more reliable and suitable method for addres...

متن کامل

Robust Structured Estimation with Single-Index Models

In this paper, we investigate general single-index models (SIMs) in high dimensions. Based on U -statistics, we propose two types of robust estimators for the recovery of model parameters, which can be viewed as generalizations of several existing algorithms for one-bit compressed sensing (1-bit CS). With minimal assumption on noise, the statistical guarantees are established for the generalize...

متن کامل

Robust Estimation in Linear Regression with Molticollinearity and Sparse Models

‎One of the factors affecting the statistical analysis of the data is the presence of outliers‎. ‎The methods which are not affected by the outliers are called robust methods‎. ‎Robust regression methods are robust estimation methods of regression model parameters in the presence of outliers‎. ‎Besides outliers‎, ‎the linear dependency of regressor variables‎, ‎which is called multicollinearity...

متن کامل

Application of Response Surface Method to the Estimation of Bond Work Index

Introduction The main objective of this contribution is to focus on the portion of the comminution process which deals with the prediction of the energy consumption due to the comminution portion of the milling processes. The comminution energy in mineral processing and cement industry is usually determined by empirical Bond Work Index (BWI), regardless of the mechanical properties of a rock....

متن کامل

Doubly robust estimation and causal inference in longitudinal studies with dropout and truncation by death: Supplementary material

Doubly robust estimation and causal inference in longitudinal studies with dropout and truncation by death: Supplementary material MICHELLE SHARDELL∗,1, GREGORY E HICKS, LUIGI FERRUCCI Department of Epidemiology and Public Health, University of Maryland 660 West Redwood Street Baltimore, Maryland 21201, U.S.A. Department of Physical Therapy, University of Delaware 303 McKinly Lab Newark, Delawa...

متن کامل

Supplementary Material for “Statistical test for consistent estimation of causal effects in linear non-Gaussian models”

This document contains supplementary material to the article ‘Statistical test for consistent estimation of causal effects in linear non-Gaussian models’, AISTATS 2012. A table of contents is given below.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017